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Decision-making Rationale for Educational Testing 
(DRET) is a proposal intended to reduce the misuse of ^ achievement 
tests. It assumes 1 ) measurement is intended to facij-itate ^ 
decision-making, 2) the choice of an instrument, the identification 
of examinees and the use of test results is determined by the 
decision for which the measurement is to be taken, and 3) effective 
educational measurement is a function of the nature of the decision, 
the examinee* s bption to measure, and his expectation Of the 
measurement. DRpT specifies that a decision issue must be clearly 
stated and validated before initiating measurement, and that 
measurement in excess of that which facilitates the decision is 
prohibited. This proposal gives the student a ma^or role in 
determining measurement activities and might be summarized in the 
motto, "Test at Student Request," (Author) 
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Consider the possibility of a well'-enforced policy which stipulates 



that achievement tests may be administered only at the request of a 



student; written justification must be given for every measurement 
"activity conducted in the school; and confidentiality of test results 



must be insured. This would represent a major change in current 
measurement programs and possibly a solution to many related problems. 
There is discontent with educational measurement as evidenced 



by the numerous attacks which have been made on standardized tests. 

It is said that they discriminate unfairly; are culturally biased, 
are products of money-making organizations, are an :xnvasion of privacy^ 
are a cause of unwarranted frustration and anxietji:^ and are outdated 



and irrelevant (Gross, 1962; Hoffman, 1961; Yourrn^, 1970). Nongraded 



programs have been introduced in an attempt to de-emphasize levels of 






achievement (Rhoades, 1966; Perkins, 1961; Goodland & Anderson, 1959). 

The pass-fail option, and the use of a Satisfactory-Unsatisfactory 
scale were initiated to min iinlze distinctions in performance (Sgan, 

1969). Parent-teacher conferences as oppibsed to the-rquarterly or 
six- week report cards , are another attempt at eliminating the quantifying 
and categorizing of students. A ^^good teacher*^*^ tends to be equated 
1 with one who avoids tests, de-emphasizes grades, and re j^rains : from 

-vV using noi^ .(Rogers, 1961).- V'- - 

acklash against our scientific passion for precision? 




Is this a backlash against 

Or is measurement the most vulnerable spot in education at ; a 
when .protest appears ■ ’ • ■ - -r._ 

the last; test we 



ppears to be in style? Perhaps if we step away from 

■ ’'V> V ."T-v \ V:'’ '’-.V 

we took, the most recent exam? we prepared, or ?the U 
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report card we just signed, it will be possible to get an objective 
view of measurement, and identify more accurately the nature of the 
problem of educational measurement. 

Measurement is an acceptable and respectable activity in the 
physical and domestic sciences. The refinement of measurement has 
usually been recognized as a mark of progress, and the limitations 
of an instrument frequently supplied the motivation for such progress. 
Impatience with imprecision resulted in the perfection of instruments 
more frequently than in their elimination. Dissatisfaction with 
the sun dial, for example, did not lead to a disregard for the time of 

day, but rather to the invention of precision time pieces. 

Educational measurement, however, has enjoyed comparatively 
little prestige or progress. The limitations of testing programs 
are vigorously discussed, but there has been a shortage of con- 
structive criticism. Perhaps we lack the patience that marked pro- 
gress in measurement requires; perhaps we doubt the need for educa- 
tional measurement. But if we are intent on improving education, 
we must resolve our problems with measurement . For improvement or 

progress without some form of measurement is questionable. 

The purpose of this^ '^ 

fo7r Educational Testing (DRET) which would provide a guide for developing 
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factor of option which are believed to be significantly related to 
the effects of educational measurement. After briefly exemplifying 
the role of these three factors (i.e., decision, expectation, and 
option) in noneducational measurement, I will discuss their function 
in educational measurement. 



Ordinarily an individual has a definite reason for using a 
tape measurer, for checking the outdoor thermometer, or for computing 
mileage on a trip. It is the reason which justifies one's reading, 
recording, adding, subtracting or comparing of numbers. A person 
records mileage, not only to satisfy curiosity, but to decide what 
figures to write on an expense account, or which of two routes is 
the shorter, or whether a recent tune-up improved his mileage. A 
pending deci.sion motivates the act of measuring. 

Accompanying the need for a decisioi. there is usxially a factor 
of expectation which affects the satisfaction derived from tneasure- 
ment. It is more enjoyable for an individual to test his tennis 
skill when he expects improved performance than when he is simply 
asked to serve as a practice partner. The task of meticulously 



recording experimental data becomes less boring and possibly 
exciting when an individual anticipates significant, effects . 
Expectation reflects a personal interest that makes measurement 
meaningful . 



A third characteristic of most measuring activities is that of 
option 
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L. -'We are\free'' ;i^',.t^e''orf'not^fake;'a:-^ ^ 

iregard a speedometer; to step on of over a scale; to measure , ' 

or, guess at:-the..,an^^Jog^aint;fh|^ ' 

limits'.,' restricH'6risi'''"ahd':‘';'cbhs'equencasV;(^^^ 
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most cases, but an individual is generally free to measure or avoid 
measurement . 

What decisions, expectations, and options are involved in 
educational measurement? At present, the answer to this question 
leaves much to be desired, I suggest that a major reason for 
educational testing is administrative routine: It is time to give 

report cards and evidence is needed in case of student or parent 
objections. It is an odd numbered year and IQ tests are given 
alternately with achievement measures. The Title I Project Report 
requires pre- and posttest comparisons. The school board wants the 
annual results of the city-wide testing program. Funds for a 
physical fitness program have to be justified. 

What decisions are at stake? How important are they? For 



whom are they important? A teacher can decide which, of her pupils 
will get A's and who will be given the B's, C's, etc. The test results 
from the Title I program may have an effect on future funding. The 
results of the city-wide testing may lead to a decision regarding -h 
of several experimental programs should be given preference for the 
coming year. But, these types of decisions are administrative. 
Meastirement is in^kosedi upon students so teachers and administrators 
can ^solve their gpEobleans or satisfy their responsibilities . The 



decision power is not an the hands .nf the individual most immediateliy 
responsihle.rfor efae. measurement. The nature, of J?. the decision, is often 
of little , or n<* intereiffit to the staiHent. The ' decisibh process seldom ' ; 
involves him th©- 'poxntiiof ' dnta. ga.th©irihg*^ 

The option faaE,tor^ :xs virtual ly nonfunctional in the^ measurement 



act±vitie|S of lo^r ^schooHs • . Ai^studfetit way arrah^^ ^9 

ERJC^ ,. 
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an exam period, but aside from such a manuever he has little or no 
choice about when, -how, or on what content he is to be measured. 

He has no right to say, ^^X^ll slcip the standardized tests this year, 
nor does he enjoy the freedom of saying, ”I'd like to measure my 
science and math achievement, for I'm considering a course in 
physics." 

The factor of expectation and its effect on educational measure- 
ment is difficult to assess. It seems reasonable, however, that 
expectation will be more specific on freely selected tasks than on 
imposed tasks. If expectation functions as a goal-setting motivator 
(De Cecco, 1969), intensifying student expectation can be expected to 
improve test performance, and should, therefore, be one goal in 
designing a testing program. 

In summary, measurement programs as they currently exist in our 
schools are not for purposes which the student defines; he is not a 



decisi-^n-maker; there is little reason to believe he has strongly 
motivated expectations; and he has probably never enjoyed less option. 
Precision instruments would probably be of little va:lue under these 
circumstances, and the most crudely designed could not be held ' 
accountable for the resulting discontent. Thus, a resolution of our 
educational measurement dilemna requires more than a technical dis- 
cussion of instruments, It requires a re-evaluatioa and reformulation; 
of our rationale and a corresponding modification of our testing 
policies # 



DRET is not an elaborately ^delineated program, iiready for adoption. 




3 .nd two ground iruXss • It assumes 1) measurement is intended to faciXi™ 
tate decision making; 2) the choice of an instrument, the identification 
of examinees, and the interpretation and use of resuXts is determined 
by the decision for which the measurement is to be taken; and 3) effective 
educationaX measurement is a function of the nature of the decision, 
the examinee’s option to measure, and his expectation of the measure- 
ment. The ground ruXes are: 

X. A decision issue must be cXearXy stated and vaXidated before 
measurement is initiated. 

2. Measurement in excess of that which directXy faciXitates 
the vaXidated decision-making activity is prohibited. 

A Xook at current educationaX practices in view of these ruXes 

cXearXy emphasizes the excessiveness of our measurement activities; It 
is difficult to imagine a decision which would require the routine 
testing of every student at a given level over aXX major areas of 
content. For a system-wide curriculum decision, one would only need 
to measure a random sample of students over just that content relevant 
to the decision issue. A decision concerning the individual student 
would require testing the one individual for whom a decision is to 



be made. It is inconceivable that a school system makes an annuaX 
decision about ^ grader on every major subject 

during the month of October or Npyember I . ' ^ 

To envision a DRET-based testing program, suppose that a school 
system had a coXXec^tion of measufeimient^ihstrumeh^ ; 

This might be a simple compilation in a filing cabinet or a soph 






uoinputerized ' bank i:;-.':;ThevXibr;ary;v includes vac^ 

ui)iii ty,rin^f r^ent s;. ^ At ^ 
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sign that reads, *'TEST AVAILABLE AT EXAMINEE REQUEST ONLY!” Aside 
from a few rare exceptions, no instrument is ^checked out” unless it 
is requested by the examinee for a specific and justifiable purpose 
implying some decision. Only that part of the instirument related to 
the decision issue is loaned and provisions for its supervised use are 
made. The test results are personal information to be revealed or 
shared only at the discretion of the examinee (or parents in tha caae of 
a very young child) . 



Under such a system the examinee’s option to measure would be 
guaranteed. Requiring the specification of the decision for which 
measurement is to be used, would help insure test , validity and maximize 
student effort. The expectation factor would undoubtedly be operative, 
since the examinee not only can but must determine when, why, and on 
what content he will be measured. Testing could become a measure- 
ment activity which is under the immediate control of the examinee- - 
assuming the examinee can be motivated to measure his academic 
achievements! 

The motivation required to operationalize a ”test at student 
request” program could be a challenge. It is obvious that students 
will have to be given incentives-«acceptablereasons--for taking tests. 
But, then, An improved rationale is desired hot only by students, but 
teachers and administrators as well. I propose that an emphasis on 



pretesting as opposedi .to posttesting would facilitate a , DRBT"base^ . o: , ■ 

•measurement ''pr^^ ‘ "v/"' ’ I", 



Few academic e^^deavcrs airjS valuable in themselves. They usually 



acquire importance only in 'view of future . activities . 
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similar activity requiring the application of shorthand I is desired. 

On the other hand, the course content, the teaching method, and the 
class composition usually assumes a given level of achievement which 
supposedly helps insure success. Thus emphasizing conditions for 
course enrollment (i.e., stating requisites and prerequisites) would 
have greater relevance than emphasizing conditions for successful 
course completion. Test taking vrould become a means by which students 
would assure themselves and the instructor that they were prepared 
for a given learning experience. 

Furthermore, if prerequisites are expressed in terms of achieve- 
ment levels rather than completed courses or specific experiences, 
the means to achievement can be left flexible. The student can simply 
be held accountable for providing evidence of achievement. A shift 
of emphasis from post-measurement to pre-measurement would offer improved 
guidance for course selection as well as protection against failure. 

The use of requisites and prerequisites might also be used to provide 
incentives for attemding to less desirable learning activities. For 
example, the student whose primary interest is shop and whose least 
desirable subject is reading might discover that the requirements for 



the first shop class are; 

1. To denionstrate at least a 4th grade level of reading 
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2. To have successfully completed one course in art design. 

3. To have demonstrated at least a 5th grade level of 
reading ability. 

4. To carry simultaneously a minimum of two additional 
subjects in which satisfactory performance is demonstrated. 

By the time the student reaches the 4th or 5th shop course the require- 
ments might include "having an academic program that can terminate in 
a diploma or certification within 12 months," Students (with varying 
degrees of counseling) would decide upon a program or course, and attempt 
to demonstrate their qualifications (t 3 q>ically by providing test scores) . 
Substandard performance (i.e., below prerequisite level) on an instru- 
ment would simply suggest to an examinee that additional knowledge or 



skill is needed prior to attempting his selected course. The individual 
who "measured up" to a course would request that his score be for- 
warded to the appropriate teacher, and thus earn admission. Students 
would test into courses rather than out of them. 



There always are and, undoubtedly, always will be students who 
lack motivation; who appear to have no course interests and therefore 
would have no reason for measuring their ability or achievements. A 
DRET approach, however, is more likely to decrease than increase 

membership in this fold. Increasing the student's decision power and 
his options in program development and measurement can be 'expected 
to reduce student apathy. 

' ,Thi^- briWf-‘prW'isiB'nta'tton;;pf a'vJD^^Trfesed^'inea^ 

.many questions : ’■ Who ’ dhterm ' pferequisi^ '..andy'the ; cpf respo^ 
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interpreted? What happens to the student who is poorly motivated for 
learning, has no desire to measure his skill, has no prerequisites he 
cares to meet? The cooperation and collaboration of educators, test 
constructors, and measurement everts can undoubtedly arrive at a 
variety of acceptable answers to these and related questions. The 
major issue is whether the ;prcposal justifies an attempt to oyercome 
the problems it may entail . 

Five practical questions related to such a program are briefly 
discussed below: 

1. How would a DRET prpgram affect test construction? There would 
be an increased demand for the services of i:estcohstructprs-ra 
need for item pools from which teachers could compile reliable 
and valid measures, as well as intact tests .and subtestsi ...Test , . 

cons tructors would"; probably 'be asked , to "make .and- standardize ins tru- 



, . ments for individual .texts , units , modtilds and . courses. 

2. How, is standardization . achieved when student's are, hp." longer obliged 
,o take tests? Qne might •;'^i^ply e:^,la^..t^ 

if whli-“.developed.-,measuring i.ihstfumehts. ,'ah.d ;'ask^ ittv. ' 

■ Construction or reVisioh'.' , On : the ,, /other ■we/might stahdardiz^^ 



t o t ake t e s t s ? 
test .‘’Cl 

instruments bn the basis of those students who;chbse to use the 
instrument during the year. . . - 

3 . What happens to the 6 or 9 week tests and the report card routine? 
A multitude 

' r ^ - 

.reporting 
, begiho’ a uhit:-^ 
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Program).'* In addition to these communications, a conference 
method might be used to discuss a student’s progress, 

4- Would teacher-made tests, typically given on a daily or weekly 
basis, be outmoded? Not at all. Such tests would be made 
available for students who desire a measurement on the specified 
content. A major function of such tests or quizzes would be to 
give the student an expectation of his knowledge and ability. 



This, in turn, would serve as a guide in his eventual selection 
of a comprehensive measurement required for a more advanced course . 

5. Could tests no longer be used to measure the attainment of 

teacher-made goals (e.g., behavioral objective)? Teachers may 
find that *' testing at student request^ is a more meaningful way of r = 

assessing teaching effectiveness than is an imposedtestingprogram.- 
The DRET approach would require a: teacher to make his goals become ■ ; 

the goals of the student. Only to the extent that the teacher 
could successfully accomplish this, could. he expect students to 
share concern for the measurement of goal attainment. The proportion . . ^ 
of students who ; chose a ' :t®sh ■ ' 

might suggest the extent to which goal attainment was unsuccessful. 

Test records could play a more significant role than ever in the 
assessment of goal attainment. 

In summary, measurement in itself is a harmless activity which 



suggests a concern for progress or change . The problems we escperience 



in educational, testing probably f effect .a ^need to challenge^ pur. ^ ; 

measurement fationaie 'more than Loiir ineasurem^^^ V 



Ln rteedf^'pf tests 

vlt •'L'~ A - 'V" * Wo Wit' nW:^ ^.and •*-: the'‘accui^^ 
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of reference materials are invalid excuses for testing programs. 

. . I 

. . ■ , I 

We must establish a rationale that will justify educational tiesting 
for the test constructor , the test administrator, and most of all, the 



test taker . 
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